scatter axis + gather axis primitives #1813

awni · 2025-01-31T00:28:12Z

Add a GatherAxis and ScatterAxis primitive to support take_along_axis and put_along_axis.

The ScatterAxis supports two reduce modes (none and sum). The sum is useful for the gradient of GatherAxis. Did not add more reduce modes to manage complexity. One can always use Scatter for the other modes or we can consider adding them in the future.

Put the kernels in the JIT by default as they are pretty simple but have a lot of combinations.

Incidentally closes #1807

TODO:

transforms
benchmarks

awni · 2025-01-31T16:29:51Z

Other benchmarks:

Benchmark	Pre	Post
Small take	0.635	0.319 msec
Large take	439.15	22.55 msec
Small put	0.569 ms	0.317 msec
Large put	16.33 ms	14.68 msec
DSV3 Expert Score	2.3 msec	0.9 msec

angeloskath

Very nice, very clean. I left only one minor comment on the copy in the CPU side.

I am wondering whether it makes sense for some of these to simply output non-contiguous arrays. Most ops in MLX output contiguous arrays ie we are greedily taking the hit as quickly as possible (with the exception of unary/binary etc). In this case we could always treat one of the two arrays (src or idx) as contiguous and adjust the output order accordingly.

Anyway, I guess it is a case rare enough to not matter so the above can be categorized as a rant.

angeloskath · 2025-01-31T21:01:46Z

mlx/backend/common/indexing.cpp

+  auto& updates = inputs[2];
+
+  // Copy src into out (copy allocates memory for out)
+  copy(src, out, CopyType::General);


Unless I am missing something, this needs to change to something that figures out the copy type. Same goes for normal Scatter. On the GPU side we have that already but I think it makes sense to go to common/copy.h. It would also enable donation of src which would be nice.

Good point, nice low hanging fruit!

angeloskath · 2025-01-31T21:02:03Z

mlx/backend/metal/CMakeLists.txt

@@ -35,6 +35,8 @@ make_jit_source(ternary_ops)
 make_jit_source(reduce_utils kernels/atomic.h kernels/reduction/ops.h)
 make_jit_source(scatter kernels/indexing.h)
 make_jit_source(gather kernels/indexing.h)
+make_jit_source(gather_axis)
+make_jit_source(scatter_axis)


angeloskath · 2025-01-31T21:50:10Z

mlx/backend/metal/indexing.cpp

+  kernel_name += upd.flags().row_contiguous ? "c" : "nc";
+  kernel_name += idx.flags().row_contiguous ? "c" : "nc";
+
+  auto lib = d.get_library(lib_name, [&]() {


100% not against it but are we moving towards having things inline unless they are used in multiple places in which case we move them to kernels.h? Or is it only for things that are always jitted?

Indeed, right now the pattern is inline if it's always jitted and in kernels.h otherwise (since jit_kernels.cpp only gets included when JIT is enabled).

angeloskath · 2025-01-31T21:58:41Z

mlx/ops.cpp

+  }
+
+  lhs_indices = astype(lhs_indices, uint32, s);
+  rhs_indices = astype(rhs_indices, uint32, s);


scatter axis + gather axis primitives

6fb1fff

awni force-pushed the gather_scatter_axis branch from c273d38 to 6fb1fff Compare January 31, 2025 00:52

add transforms

aadf66e

awni force-pushed the gather_scatter_axis branch from a9cb2c1 to aadf66e Compare January 31, 2025 16:51

awni marked this pull request as ready for review January 31, 2025 16:51

awni requested review from angeloskath and barronalex January 31, 2025 16:51

awni mentioned this pull request Jan 31, 2025

Compile and use put along axis in deep seek routing function ml-explore/mlx-examples#1234

Closed

angeloskath approved these changes Jan 31, 2025

View reviewed changes

awni force-pushed the gather_scatter_axis branch from 93dbfc9 to eb9a9d5 Compare January 31, 2025 22:44

comment

199baf0

awni force-pushed the gather_scatter_axis branch from eb9a9d5 to 199baf0 Compare February 1, 2025 00:02

awni merged commit b7c9f1d into main Feb 1, 2025
5 checks passed

awni deleted the gather_scatter_axis branch February 1, 2025 04:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

scatter axis + gather axis primitives #1813

scatter axis + gather axis primitives #1813

awni commented Jan 31, 2025 •

edited

Loading

awni commented Jan 31, 2025 •

edited

Loading

angeloskath left a comment

angeloskath Jan 31, 2025

awni Jan 31, 2025

angeloskath Jan 31, 2025

angeloskath Jan 31, 2025

awni Jan 31, 2025

angeloskath Jan 31, 2025

scatter axis + gather axis primitives #1813

scatter axis + gather axis primitives #1813

Conversation

awni commented Jan 31, 2025 • edited Loading

awni commented Jan 31, 2025 • edited Loading

angeloskath left a comment

Choose a reason for hiding this comment

angeloskath Jan 31, 2025

Choose a reason for hiding this comment

awni Jan 31, 2025

Choose a reason for hiding this comment

angeloskath Jan 31, 2025

Choose a reason for hiding this comment

angeloskath Jan 31, 2025

Choose a reason for hiding this comment

awni Jan 31, 2025

Choose a reason for hiding this comment

angeloskath Jan 31, 2025

Choose a reason for hiding this comment

awni commented Jan 31, 2025 •

edited

Loading

awni commented Jan 31, 2025 •

edited

Loading